Two-Variance-Component Model Improves Genetic Prediction in Family Datasets.

نویسندگان

  • George Tucker
  • Po-Ru Loh
  • Iona M MacLeod
  • Ben J Hayes
  • Michael E Goddard
  • Bonnie Berger
  • Alkes L Price
چکیده

Genetic prediction based on either identity by state (IBS) sharing or pedigree information has been investigated extensively with best linear unbiased prediction (BLUP) methods. Such methods were pioneered in plant and animal-breeding literature and have since been applied to predict human traits, with the aim of eventual clinical utility. However, methods to combine IBS sharing and pedigree information for genetic prediction in humans have not been explored. We introduce a two-variance-component model for genetic prediction: one component for IBS sharing and one for approximate pedigree structure, both estimated with genetic markers. In simulations using real genotypes from the Candidate-gene Association Resource (CARe) and Framingham Heart Study (FHS) family cohorts, we demonstrate that the two-variance-component model achieves gains in prediction r(2) over standard BLUP at current sample sizes, and we project, based on simulations, that these gains will continue to hold at larger sample sizes. Accordingly, in analyses of four quantitative phenotypes from CARe and two quantitative phenotypes from FHS, the two-variance-component model significantly improves prediction r(2) in each case, with up to a 20% relative improvement. We also find that standard mixed-model association tests can produce inflated test statistics in datasets with related individuals, whereas the two-variance-component model corrects for inflation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved customer choice predictions using ensemble methods

In this paper various ensemble learning methods from machine learning and statistics are considered and applied to the customer choice modeling problem. The application of ensemble learning usually improves the prediction quality of flexible models like decision trees and thus leads to improved predictions. We give experimental results for two real-life marketing datasets using decision trees, ...

متن کامل

Combining Neural Network with Genetic Algorithm for prediction of S4 Parameter using GPS measurement

  The ionospheric plasma bubbles cause unpredictable changes in the ionospheric electron density. These variations in the ionospheric layer can cause a phenomenon known as the ionospheric scintillation. Ionospheric scintillation could affect the phase and amplitude of the radio signals traveling through this medium. This phenomenon occurs frequently around the magnetic equator and in low latitu...

متن کامل

A Random Forest Classifier based on Genetic Algorithm for Cardiovascular Diseases Diagnosis (RESEARCH NOTE)

Machine learning-based classification techniques provide support for the decision making process in the field of healthcare, especially in disease diagnosis, prognosis and screening. Healthcare datasets are voluminous in nature and their high dimensionality problem comprises in terms of slower learning rate and higher computational cost. Feature selection is expected to deal with the high dimen...

متن کامل

Intelligent prediction of heating value of coal

The gross calorific value (GCV) or heating value of a sample of fuel is one of the important properties which defines the energy of the fuel. Many researchers have proposed empirical formulas for estimating GCV value of coal. There are some known methods like Bomb Calorimeter for determining the GCV in the laboratory. But these methods are cumbersome, costly and time consuming. In this paper, m...

متن کامل

Incorporating published univariable associations in diagnostic and prognostic modeling

BACKGROUND Diagnostic and prognostic literature is overwhelmed with studies reporting univariable predictor-outcome associations. Currently, methods to incorporate such information in the construction of a prediction model are underdeveloped and unfamiliar to many researchers. METHODS This article aims to improve upon an adaptation method originally proposed by Greenland (1987) and Steyerberg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • American journal of human genetics

دوره 97 5  شماره 

صفحات  -

تاریخ انتشار 2015